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The MAILING DATE of this communication appears on the cover sheet with the correspondence address 
Period for Reply 

A SHORTENED STATUTORY PERIOD FOR REPLY IS SET TO EXPIRE 3 MONTH(S) FROM 
THE MAILING DATE OF THIS COMMUNICATION. 

- Extensions of time may be available under the provisions of 37 CFR 1 .136(a). In no event, however, may a reply be timely filed 
after SIX (6) MONTHS from the mailing date of this communication. 

- If the period for reply specified above is less than thirty (30) days, a reply within the statutory minimum of thirty (30) days will be considered timely. 

- If NO period for reply is specified above, the maximum statutory period will apply and will expire SIX (6) MONTHS from the mailing date of this communication. 
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2a)D This action is FINAL. 2b)[3 This action is non-final. 

3) D Since this application is in condition for allowance except for formal matters, prosecution as to the merits is 

closed in accordance with the practice under Ex parte Quayle, 1935 CD. 1 1 , 453 O.G. 213. 
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5) \Z\ Claim(s) is/are allowed. 
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DETAILED ACTION 



Claim Objections 



1 . Claims 5 to 19, and 26 to 29 are objected to because of the following 
informalities: 

In claim 5, line 6, "said weighted combination" lacks antecedent basis. The term 
"weighted" was deleted from independent claim 1, upon which claim 5 depends. 
Appropriate correction is required. 

Claim Rejections - 35 USC § 102 

2. The following is a quotation of the appropriate paragraphs of 35 U.S.C. 102 that 
form the basis for the rejections under this section made in this Office action: 

A person shall be entitled to a patent unless - 

(b) the invention was patented or described in a printed publication in this or a foreign country or in public 
use or on sale in this country, more than one year prior to the date of application for patent in the United 



3. Claims 1 to 5, 21 to 25, 30 to 32, 37 to 39, 48 to 52, 68 to 73, 77 to 79, 84 to 86, 
90, 96 to 97, 99, and 101 are rejected under 35 U.S.C. 102(b) as being anticipated by 
Chou et a/. 

Regarding independent claims 1, 48, 97, 99, and 101, Chou etal. discloses an 
apparatus, method, and instructions, comprising: 



States. 
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"a receiver operable to receive an input signal" - an input of an unknown speech 
string 18 (an utterance) of words is received from a microphone (column 4, lines 34 to 
35: Figure 1); 

"a recognition processor operable to compare said input signal with stored label 
models to generate a recognized sequence of labels in said input signal and confidence 
data representative of the confidence that the recognized sequence of labels is 
representative of said input signal" - recognition processor 10 receives the input, 
accesses the recognition database 12, scores the unknown speech string of words 
against the recognition models in the recognition database 12, and generates a 
hypothesis string signal 20; verification processor 16 receives the hypothesis string, and 
generates a confidence measure signal 22 (column 4, lines 34 to 51: Figure 1); 

"a similarity measure calculator operable to compare said recognized sequence 
of labels with a stored sequence of labels using a combination of predetermined 
confusion data which defines confusability between different labels and said confidence 
data, to provide a measure of the similarity between the recognized sequence of labels 
and the stored sequence of labels" - confidence score computation ("a similarity 
measure calculator") for a speech segment q relates a comparison between a word 
model score ("said confidence data") and scores computed with the anti-word model 
("predetermined confusion data which defines confusability between different labels"); in 
Equation (2), L(O q ;0,l) is "the measure of similarity" calculated by the similarity measure 
calculator, gi(O q ) = log p(O q \Q[ k) ) is "the confidence data" for the keyword hypothesis 
{0i (k) }, and Gf(Oq) is the "predetermined confusion data which defines confusability" for 
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anti-keywords {6i (a) } which handle confusibility among keywords (column 8, lines 33 to 
55: Figure 2). 

Regarding claims 2 and 49, Chou et ai discloses the confidence measure is 
generated based upon data stored in verification database 16 (column 4, lines 34 to 51 : 
Figure 1). 

Regarding claims 3 and 50, Chou etal. discloses a word-based confidence score 
34 (column 8, lines 40 to 55: Figure 2); each word is a "label" in a string of words being 
recognized. 

Regarding claims 4 and 51 , Chou et ai discloses string models are generated in 
an "N" best list ("a list of alternatives") by N-best string model generator 46 (column 6, 
line 45 to column 7, line 53: Figure 2; column 9, line 54 to column 10, line 14). 

Regarding claims 5 and 52, Chou et ai discloses Viterbi alignment ("an aligner") 
of the input string, O, against the model sets for each given word string in the N-best 
string list (column 7, lines 8 to 15); average word-based confidence score processor 36 
("a combiner") performs mathematical averaging for each word segment signal of the 
hypothesis string to generate an average word-based confidence score signal ("said 
similarity measure") (column 5, lines 53 to 67: Figure 2). 

Regarding claims 21 and 68, Chou et ai discloses Viterbi alignment ("an aligner") 
of the input string, O, against the models sets for each given word string in the N-best 
string list (column 7, lines 8 to 15); Viterbi alignment is "a dynamic programming 
technique". 
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Regarding claims 22 to 25, 69 to 72, and 90, Chou et al. discloses Viterbi 
alignment ("an aligner") of the input string, O, against the model sets for each given 
word string in the N-best string list (column 7, lines 8 to 15); implicitly, Viterbi alignment 
determines "progressively a plurality of possible alignments", generates scores for each 
given word in the N-best list, determines "an optimum alignment", and "combines the 
scores" for each word in the word string. 

Regarding claims 30 to 32 and 77 to 79, Chou et al. discloses the input string is 
speech (column 4, lines 34 to 35: Figure 1), which is a time sequential audio signal of 
words. 

Regarding claims 37, 38, 84, and 85, Chou et al. discloses confidence score 
computation for a speech segment q relates a comparison between a word model score 
("said confidence data") and scores computed with the anti-word model ("said confusion 
data"); in Equation (2), L(O q ;Q,l) is "the measure of similarity" calculated by the similarity 
measure calculator, g,(O q ) = log p(O q \9f k) ) is "the confidence data" for the keyword 
hypothesis {0i (k) }, and Gi(Oq) is "the confusion data" for anti-keywords {0i (a) } which 
handle confusibility among keywords (column 8, lines 33 to 55: Figure 2). 

Regarding claims 39 and 86, Chou et al. discloses an average confidence score 
based on upon the average of word-based confidence scores (column 5, lines 53 to 67); 
an average confidence score is a normalization from each of the word-based 
confidence scores. 

Regarding claim 73, Chou et al. discloses each of the words ("labels") in the 
unknown speech string ("each of the labels in said recognized sequence of labels") is 
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scored against recognition models ("stored sequences of labels") in the recognition 
database 12 (column 4, lines 34 to 51: Figure 1). 

Regarding claim 96, Chou et a/, discloses all of the ordered method steps. 

Claim Rejections - 35 USC § 103 

4. The following is a quotation of 35 U.S.C. 103(a) which forms the basis for all 
obviousness rejections set forth in this Office action: 

(a) A patent may not be obtained though the invention is not identically disclosed or described as set 
forth in section 102 of this title, if the differences between the subject matter sought to be patented and 
the prior art are such that the subject matter as a whole would have been obvious at the time the 
invention was made to a person having ordinary skill in the art to which said subject matter pertains. 
Patentability shall not be negatived by the manner in which the invention was made. 

5. Claims 20, 35, 36, 46, 67, 82, 83, 93, 98, 100, and 102 are rejected under 35 
U.S.C. 103(a) as being unpatentable over Chou etal. in view of Arefetal. 

Concerning claims 20 and 67, Chou et ai omits an aligner operable to identify 
deletions and insertions. However, Arefet ai teaches an analogous art speech 
recognition system for correcting misspelled words in a string of text. (Column 1 , Lines 
32 to 52) Specifically, Arefet ai discloses detecting recognition errors as models from 
insertion errors and deletion errors. (Column 3, Lines 36 to 60) It is suggested that 
there are advantages to speed the search process and reduce the size of the database 
by correcting misrecognized or misspelled words with the search technique of Aref et ai 
(Column 1 , Lines 52 to 61 ) It would have been obvious to one having ordinary skill in 
the art to incorporate the insertion and deletion error technique of Aref et ai into the 
word-based confidence score method of Chou et ai for the purpose of correcting 
misrecognitions with a high speed search process and reduced database size. 



• ♦ 
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Concerning claims 35, 36, 82, and 83, Chou et aL omits mis-typing probabilities 
and mis-spelling probabilities based upon sub-word units. However, Arefetal. teaches 
an analogous art speech recognition system for correcting misspelled words in a string 
of text. (Column 1 , Lines 32 to 52) Specifically, Arefet aL discloses probabilities for 
letters being recognized incorrectly, where letters are sub-word units, to estimate a 
measure of similarity between two words. (Column 4, Lines 1 to 59) Recognition errors 
are based upon typing errors, e.g. "airnmail" is mistakenly inserted for the word 
"airmail". (Column 3, Lines 36 to 53) It is suggested that there are advantages to 
speed the search process and reduce the size of the database by correcting 
misrecognized or misspelled words with the search technique of Arefet aL (Column 1 , 
Lines 52 to 61 ) It would have been obvious to one having ordinary skill in the art to 
utilize the mis-typing and mis-spelling technique of sub-word units taught by Arefet aL 
into the word-based confidence score method of Chou et aL for the purpose of 
correcting misrecognitions with a high speed search process and reduced database 
size. 

Concerning claims 46, 93, 98, 100, and 102, Chou et aL omits an application of 
speech recognition to querying a database and obtaining information from the database, 
although this is a well known application for speech recognition systems, generally. 
However, Arefet aL teaches an analogous art speech recognition system for searching 
a database for recognized text by querying keywords. (Column 2, Lines 41 to 50) It is 
suggested that there are advantages to speed the search process and reduce the size 
of the database by correcting misrecognized or misspelled words with the search 
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technique of Arefetal. (Column 1, Lines 52 to 61) It would have been obvious to one 
having ordinary skill in the art to apply the word-based confidence score method of 
Chou et al. to a retrieval system from a database of automatically recognized text as 
taught by Arefet al. for the purpose of correcting misrecognitions with a high speed 
search process and reduced database size. 

6. Claims 33, 34, 80, and 81 are rejected under 35 U.S.C. 103(a) as being 
unpatentable over Chou et al. in view of Wheatley et ai 

Chou et al. discloses recognizing speech with word-based confidence scores, 
where the labels are words, but omits recognizing sub-word units and phonemes. 
However, it is a well known art recognized alternative in speech recognition to recognize 
phonemes, which are sub-word units, rather than words. Wheatley et al. teaches a 
related apparatus and method for speech recognition, where speech is recognized with 
Hidden Markov Models representing phonetic units instead of words. (Column 7, Lines 
14 to 37) It is suggested that there is an advantage of representing real world, 
unscripted conversations. (Column 2, Lines 28 to 39) It would have been obvious to 
utilize sub-word phonetic units for the speech recognition system of Chou et al. as 
suggested by Wheatley et al. for the purpose of better recognizing real world, unscripted 
conversations. 
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Allowable Subject Matter 



7. Claims 6 to 19, 26 to 29, 40 to 45, 47, 53 to 66, 74 to 76, 87 to 89, 91 to 92, and 
94 to 95 are objected to as being dependent upon a rejected base claim, but would be 
allowable if rewritten in independent form including all of the limitations of the base 
claim and any intervening claims. 

Response to Arguments 

8. Applicants' Preliminary Amendments filed 04 February 2004 and 23 February 
2004 present broadened claims requiring new grounds of rejection. 



9. The prior art made of record and not relied upon is considered pertinent to 
Applicants' disclosure. 

Hon et al. and Visser disclose related art. 

Any inquiry concerning this communication or earlier communications from the 
examiner should be directed to Martin Lerner whose telephone number is (703) 308- 
9064. The examiner can normally be reached on 8:30 AM to 6:00 PM Monday to 
Thursday. 

If attempts to reach the examiner by telephone are unsuccessful, the examiner's 
supervisor, Richemond Dorvil can be reached on (703) 305-9645. The fax phone 



Conclusion 
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number for the organization where this application or proceeding is assigned is 703- 
872-9306. 

Information regarding the status of an application may be obtained from the 
Patent Application Information Retrieval (PAIR) system. Status information for 
published applications may be obtained from either Private PAIR or Public PAIR. 
Status information for unpublished applications is available through Private PAIR only. 
For more information about the PAIR system, see http://pair-direct.uspto.gov. Should 
you have questions on access to the Private PAIR system, contact the Electronic 
Business Center (EBC) at 866-217-9197 (toll-free). 
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Martin Lerner 
Examiner 
Art Unit 2654 



